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Abstract. Causal inference with interference is a rapidly growing area. 
The literature has begun to relax the “no-interference” assumption that 
the treatment received by one individual does not affect the outcomes 
of other individuals. In this paper we briefly review the literature on 
causal inference in the presence of interference when treatments have 
been randomized. We then consider settings in which causal effects 
in the presence of interference are not identified, either because ran¬ 
domization alone does not suffice for identification or because treat¬ 
ment is not randomized and there may be unmeasured confounders 
of the treatment-outcome relationship. We develop sensitivity analysis 
techniques for these settings. We describe several sensitivity analysis 
techniques for the infectiousness effect which, in a vaccine trial, cap¬ 
tures the effect of the vaccine of one person on protecting a second 
person from infection even if the first is infected. We also develop two 
sensitivity analysis techniques for causal effects under interference in 
the presence of unmeasured confounding which generalize analogous 
techniques when interference is absent. These two techniques for un¬ 
measured confounding are compared and contrasted. 

Key words and phrases: Causal inference, infectiousness effect, inter¬ 
ference, sensitivity analysis, spillover effect, stable unit treatment value 
assumption, vaccine trial. 


1. INTRODUCTION 

Cox [(1958), page 19] wrote that there is no in¬ 
terference between different units if the observation 
on one unit is unaffected by the particular assign¬ 
ment of treatment to the other units. The assump¬ 
tion of no interference is a key component of Rubin’s 
“stable unit treatment value assumption,” called 
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SUTVA (Rubin (1986)), that is often required for 
potential outcomes to be well-defined. However, in 
many settings, the assumption of no interference ob¬ 
viously does not hold. Consider an individual who, 
if not vaccinated, would have infected another per¬ 
son, but who, if vaccinated, would not infect that 
other person. In this case, the infection outcome of 
the second person depends on the treatment of the 
first individual, and there is thus interference. Un¬ 
der the assumption of no interference, the effect of a 
treatment compares two potential outcomes the in¬ 
dividual would exhibit under treatment and control. 
With interference, an individual could have many 
potential outcomes depending on the treatments as¬ 
signed to the other individuals (Rubin, 1978, 1990). 

In some settings interference is a nuisance, while 
in other settings it creates effects of scientific, pub¬ 
lic health or social science interest. An example of 
the former includes agricultural experiments where 
treatments in neighboring plots can interfere with 
one another (Kempton (1997)). Fallow rows between 
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treatment plots can sometimes eliminate interfer¬ 
ence between plots, but more often the interference 
must be taken into account. In infectious diseases, 
interference is inherent in the biology of transmis¬ 
sion, it cannot be eliminated, and it produces in¬ 
trinsically interesting effects. Social interaction is a 
primary source of interference in studies with hu¬ 
mans subjects and often cannot be eliminated. 

Progress in causal inference with interference has 
been made recently in different contexts, including 
those in the social sciences, econometrics and in¬ 
fectious diseases. Several causal effects can be de¬ 
fined in the presence of interference, and sometimes 
similar effects have different names in different con¬ 
texts. Social scientists have long been interested in 
the effects of neighborhoods on the economic, soci¬ 
ological and psychological well-being of their inhab¬ 
itants, resulting in the term neighborhood effects 
(Sobel (2006)). The consequences of interference be¬ 
tween individuals in this context are also known as 
spillover effects. In infectious diseases, these effects 
were generally called indirect effects of interventions 
(Halloran and Struchiner (1991)). 

In Section 1.1 we present informally some exam¬ 
ples of studies on causal inference with interference 
in different contexts. In Section 2 we present formal 
definitions of direct, indirect, total and overall ef¬ 
fects as well as infectiousness effects in the presence 
of interference. In Section 3 we develop a number 
of new sensitivity analysis techniques in settings in 
which causal effects are not identified, either because 
the effect estimand itself relies on assumptions be¬ 
yond randomization or because treatment is not ran¬ 
domized and there may be unmeasured confound¬ 
ing. The sensitivity analysis techniques help address 
these issues of identification in these settings. Sec¬ 
tion 3 contains the new results of the paper and, 
as will be seen below, many of these new results 
in the context of interference build on foundational 
work on sensitivity analysis by Robins, Rotnitzky 
and Scharfstein (2000) outside the context of inter¬ 
ference. Section 4 offers some concluding remarks 
on directions for future research on interference. A 
reader who is primarily interested in the technical 
development can skip Section 1.1 and move on di¬ 
rectly to Section 2. 

1.1 Motivating Examples 

1.1.1 Interference and housing mobility Sobel 
(2006) considered interference in the Moving to Op¬ 
portunity (MTO) demonstration sponsored by the 


U.S. Department of Housing and Urban Develop¬ 
ment. In this housing mobility experiment in poor 
neighborhoods in five cities, eligible ghetto residents 
were randomly assigned to receive one of two forms 
of relocation assistance or no assistance (control). 
Sobel argued that the no interference assumption is 
not plausible for the MTO demonstration because 
many of the participants likely knew other partic¬ 
ipants at each of the five sites. Thus, the partici¬ 
pants could have influenced each other through so¬ 
cial interaction. For example, a family that decided 
to move to a new neighborhood could give rise to 
worse outcomes for a family that stayed in the orig¬ 
inal neighborhood because of the decline in social 
support for the family that stayed. 

Sobel (2006) defined causal estimands and esti¬ 
mators for indirect/spillover effects for the MTO 
randomized trial of housing vouchers, taking com¬ 
pliance into account. He assumed that interference 
could occur within the sites, but not across sites, 
which he called partial interference. He made a key 
contribution in proposing causal estimands for as¬ 
sessing effects in the presence of interference by av¬ 
eraging causal effects over all possible treatment as¬ 
signments for a particular allocation strategy com¬ 
pared to a benchmark strategy wherein no units re¬ 
ceived the treatment assignment. Although his lan¬ 
guage is different, he essentially defined causal es¬ 
timands analogous to the direct, indirect, total and 
overall effects defined in the next section. 

He then compared his causal estimands to what 
is usually estimated in studies of housing mobility 
not taking interference into account. He showed that 
what is usually estimated actually gives the differ¬ 
ence between (i) the average effect of the voucher 
on those who received them and (ii) the average ef¬ 
fect on those not receiving vouchers of having people 
leave the neighborhood. Both effects could be nega¬ 
tive (detrimental) with the difference positive, thus 
making it important to take potential interference 
into account. 

1.1.2 Interference in vaccination programs Moti¬ 
vated by an interest in the effects of vaccination and 
vaccination programs, Struchiner et al. (1990) and 
Halloran and Struchiner (1991, 1995) conceptually 
defined direct, indirect, total and overall effects in 
the presence of interference. The direct effect of a 
treatment on an individual was defined as the dif¬ 
ference between the potential outcome for that in¬ 
dividual given treatment compared to the potential 
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outcome for that individual without treatment if the 
treatment assignment in the others in the popula¬ 
tion was held fixed. In contrast to direct effects, an 
indirect effect describes the effect on an individual of 
the treatment received by others in the group when 
that individual’s treatment was held fixed. In partic¬ 
ular, the indirect effect of a treatment on an individ¬ 
ual was defined as the difference between the poten¬ 
tial outcomes for that individual without treatment 
when the group (i) receives an intervention program 
and (ii) receives a benchmark program of no inter¬ 
vention. Total effects describe the combination of 
direct and indirect effects of a particular treatment 
assignment on an individual. The total effect of a 
treatment on an individual is the difference between 
the potential outcomes for that individual (i) with 
treatment when the group receives an intervention 
program and (ii) without treatment when the group 
receives no intervention. Overall effects describe the 
average effect of an intervention relative to no inter¬ 
vention. 

Halloran and Struchiner (1995) proposed individual- 
level causal estimands of direct, indirect, total and 
overall in the presence of interference by letting the 
potential outcomes for any individual depend on the 
vector of treatment assignments to other individuals 
in the group (Rubin, 1978, 1990). However, they did 
not propose population level causal estimands. 

A number of studies have been conducted to es¬ 
timate indirect, total or overall effects of vaccina¬ 
tion programs outside of the causal inference frame¬ 
work. In the United Kingdom, the indirect effect 
of a new program of meningococcal C vaccination 
was estimated by comparing the attack rates in un¬ 
vaccinated children and adolescents before and after 
introduction of the program (Ramsay et al. (2003)). 


The United Kingdom introduced routine meningo¬ 
coccal serogroup C vaccination for infants in Novem¬ 
ber 1999. The vaccine was also offered to all chil¬ 
dren and adolescents aged <18 years in a phased 
catch-up program. Adolescents were vaccinated first 
and the program was completed by the end of 2000. 
About 75% of the children and adolescents were 
vaccinated. The attack rate in unvaccinated infants 
through adolescents per 100,000 unvaccinated pop¬ 
ulation in July 1998-June 1999 was 4.08 (95% Cl: 
3.7, 4.5) and in July 2001-June 2002 was 1.36 (95% 
Cl: 0.86, 1.85). Vaccinating about 75% of the chil¬ 
dren and adolescents thus seemed to produce an in¬ 
direct effect, with a relative reduction in the number 
of confirmed meningococcal C cases in the unvacci¬ 
nated children and adolescents, of 67% (95% Cl: 52, 
77). 

To obtain group- and population-level causal es¬ 
timands for direct, indirect, total and overall causal 
effects of treatment, Hudgens and Halloran (2008) 
proposed a two-stage randomization scheme, the 
first stage at the group level, the second at the in¬ 
dividual level within groups based on Sobel’s ap¬ 
proach of averaging over all possible treatment as¬ 
signments. As did Sobel (2006), they assumed in¬ 
terference can occur within groups but not across 
groups. The causal estimands defined by Hudgens 
and Halloran (2008) are applicable to other situa¬ 
tions with interference in fixed groups of individu¬ 
als where treatment can be assigned to individuals 
within groups. A brief formal development is given 
in Section 2. 

As an example, Hudgens and Halloran (2008) pre¬ 
sented a hypothetical two-stage randomized placebo- 
controlled trial of cholera vaccines (Table 1). Sup¬ 
pose in the first stage five geographically separate 
groups were randomized so two were assigned to 


Table 1 

Illustrative example of a two-stage randomized placebo-controlled cholera vaccine 
trial based on data from Ali et al. (2005). Group assignment corresponds to 50% or 
30% vaccine coverage (from Hudgens and Halloran (2008)) 


Group 

Group assignment 
(% vaccinated) 

Vaccine recipients 

Placebo recipients 

Total 

Cases 

Total 

Cases 

1 

50 

12,541 

16 

12,541 

18 

2 

50 

11,513 

26 

11,513 

54 

3 

30 

10,772 

17 

25,134 

119 

4 

30 

8883 

22 

20,727 

122 

5 

30 

5627 

15 

13,130 

92 
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vaccinate 50% and three were assigned to vaccinate 
30% of individuals, then individuals were randomly 
assigned to be vaccinated or not. Causal effect es¬ 
timates (and estimated variance) are given in the 
change in number of cases per 1000 individuals per 
year. The estimated indirect effect of vaccinating 
50% versus 30% in the unvaccinated individuals is 
2.81 (3.079). This suggests that vaccinating 50% of 
the population would result in 2.8 fewer cases per 
1000 unvaccinated people per year compared with 
vaccinating only 30%. Similarly, the estimated total 
effect is 4.11 (0.672). This suggests that vaccinat¬ 
ing 50% of the population would result in about 4.1 
fewer cases per 1000 vaccinated people per year com¬ 
pared with unvaccinated persons vaccinating only 
30%. The estimated overall effect is 2.37 (1.430). 
The estimated overall effect is a summary compar¬ 
ison of the two strategies, suggesting that, on aver¬ 
age, 50% vaccine coverage results in 2.4 fewer cases 
of cholera per 1000 individuals per year compared to 
30% vaccine coverage. A public health professional 
could use these estimates in evaluating the cost- 
benefit of vaccinating more people and preventing 
more cases versus vaccinating fewer people. The di¬ 
rect effect under 30% coverage is 3.64 (0.178), nearly 
three times greater than the direct effect under 50% 
coverage, which is 1.30 (0.856). The difference shows 
that even the direct effects can depend on the level 
of coverage due to interference between individuals. 
Note that the direct effect under 50% coverage of 
1.30 and the indirect effect of 2.81 sum to the total 
effect of 4.11. 

1.1.3 Interference in the context of kindergarten 
retention Hong and Raudenbush (2006) considered 
interference in the context of the effect on reading 
scores of children of being retained in kindergarten 
versus being promoted to the first grade. Interfer¬ 
ence was assumed possible through the dependence 
of the potential outcomes of reading test scores of 
one child on whether other children were retained or 
not. Hong and Raudenbush were principally inter¬ 
ested in the effect of a child’s being retained and how 
this varied with being in schools with low retention 
and versus those with high retention. They used a 
sample of data from 1080 schools with 471 kinder¬ 
garten retainees and 10,255 promoted students. In 
their application, students are clustered in schools. 
Individual treatment assignment was whether a stu¬ 
dent is retained. They used a school-level scalar 
function based on the proportion of the students 


that were retained to determine whether a school 
was a “high-retention” or “low-retention” school. 
The study was observational at two levels: schools 
were not randomized to have high or low retention, 
and students were not randomized to be retained. 
However, they framed their analysis within a two- 
stage randomization procedure similar to that de¬ 
scribed in Hudgens and Halloran (2008) in which 
both stages would have been randomized. They also 
assumed interference was possible within schools but 
not across schools. 

Using a propensity score-based approach, ac¬ 
counting for interference, and assuming that as¬ 
signment at both the school and the individ¬ 
ual level was ignorable given a number of ob¬ 
served individual-level, school-level, and school- 
aggregated-individual-level characteristics, Hong and 
Raudenbush (2006) obtained estimates of the effect 
on reading scores of retention in high-retention and 
low-retention schools. Specifically, in low-retention 
schools, they estimated the effect on reading scores 
of a student being retained versus being promoted, 
was —8.18 (95% Cl: —10.02, —6.34), and in high- 
retention schools the effect estimate was —8.86 (95% 
Cl: —11.56, —6.16). A standard deviation in read¬ 
ing test scores in this sample is 13.48 points. We 
will return to this example below to demonstrate 
sensitivity analysis in the context of interference. 

1.1.4 Interference between two sides of the face 
Rosenbaum (2007) took a different approach to 
causal inference with interference when analyzing 
randomized experiments than those in previous sec¬ 
tions. He pointed out that if Fisher’s null hypothesis 
of no effect for any individual in the population is 
true, then there is no effect and consequently no 
interference. Thus, Fisher’s permutation test of no 
effect will have the correct level, even if, under the 
alternative hypothesis, there would be interference. 
He presented several examples, including data from 
a randomized, double-blind experiment in which 15 
people received different preparations of botulinum 
A exotoxin on each side of their face to treat wrin¬ 
kles to test which was less painful. 

Rosenbaum presented exact nonparametric meth¬ 
ods for inverting randomization tests to obtain con¬ 
fidence intervals for assessing treatment effect as¬ 
suming nothing about the structure of the interfer¬ 
ence between units. He assumed that there were a 
number of blocks (groups) with a number of individ¬ 
uals within each group, some of which, but not all, 
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were randomized to a treatment, the others to con¬ 
trol. He developed a general notation that allowed 
interference across blocks and did not assume a two- 
stage randomization. Rosenbaum (2007) differenti¬ 
ated two null hypotheses. The first null hypothesis 
is that treatment has no primary effect, that is, the 
response of each unit does not vary under differ¬ 
ent randomization assignments in the collection of 
the possible assignment matrices with fixed number 
randomized in each block. Analogous to the bench¬ 
mark allocation of Sobel (2006) and the two-stage 
randomized trials described above where possibly 
some communities receive only the control interven¬ 
tion, Rosenbaum (2007) invoked uniformity trials 
in which individuals within treatment groups would 
be randomly assigned to treatment and control, but 
everyone in control groups would receive just con¬ 
trol. The second null hypothesis is that treatment 
has no effect, that is, under different randomiza¬ 
tion assignments in the collection of the possible as¬ 
signment matrices with fixed number randomized in 
each block, each individual’s response equals his re¬ 
sponse in a uniformity trial. If there is no effect, then 
no benefit is gained from receiving the treatment. If 
there is no primary effect, there is no advantage to 
being one of the treated individuals, but the benefits 
could be shared by all of the individuals. 

Rosenbaum gave conditions using distribution- 
free tests in which without performing the unifor¬ 
mity trials, he was able to get confidence statements 
about the magnitude of the effect and/or primary ef¬ 
fect, though not able to distinguish between them. 
In the botox example, the 15 people are the blocks, 
the two sides of the face the individuals. All 15 peo¬ 
ple reported less pain from the treatment contain¬ 
ing alcohol. Using his method, the hypotheses of no 
effect and no primary effect were rejected with a 
one-sided significance level 0.000031. 

Luo et al. (2012) extended this approach in the 
context of a cognitive neuroscience experiment in 
which the brains of a moderate number of sub¬ 
jects are studied using functional magnetic reso¬ 
nance imaging while challenged with a rapid fire 
sequence of randomized stimuli. Interference was as¬ 
sumed to occur between units of time in the same 
individual. 

1.1.5 Interference and infectiousness effects In 
vaccine contexts, a vaccinated person who becomes 
infected might have a lower probability of transmit¬ 
ting to a susceptible person during a contact than an 


unvaccinated person who becomes infected. This is 
called the effect of the vaccine on infectiousness. In a 
study in Niakhar, Senegal, for example, Preziosi and 
Halloran (2003) estimated the relative reduction in 
infectiousness to household contacts of a vaccinated 
case of pertussis compared to an unvaccinated case 
to be 67% (95% Cl: 29, 86). Estimating reduction in 
infectiousness can be of considerable public health 
interest, particularly with vaccines that do not pro¬ 
tect well against infection. 

Developing general methods for causal inference 
for infectiousness effects poses complicated chal¬ 
lenges. Even if the vaccine is randomized, the in¬ 
fectiousness effect is measured only in people who 
become infected, a post-randomization variable, so 
the estimate would in general be subject to selection 
bias. VanderWeele and Tchetgen Tchetgen (2011b) 
and Halloran and Hudgens (2012a, 2012b) proposed 
causal quantities corresponding to the infectious¬ 
ness effect in the simple situation of households of 
size two. The general approach combines causal in¬ 
ference with interference with principal stratifica¬ 
tion (Frangakis and Rubin (2002)). The latter ac¬ 
counts for the fact that the comparison in the groups 
who become infected may be subject to selection 
bias. The causal infectiousness effect is not identifi¬ 
able without further assumptions. In Section 2.6 we 
present the bounds that were developed previously. 
In Section 3.2 we present new results for sensitivity 
analyses for causal infectiousness effects. 

1.1.6 Other approaches Manski (2013) studied 
identification of potential outcome distributions 
when treatment response may have social interac¬ 
tions. He called the no interference assumption the 
individualistic treatment response to differentiate it 
from other forms of treatment response that depend 
on social interaction. VanderWeele et al. (2012) dis¬ 
cussed the relation between causal interactions and 
interference and how under randomization it is pos¬ 
sible to test for specific forms of interference. They 
show that the theory for causal interactions provides 
a conceptual apparatus for assessing interference as 
well. 

2. FORMALIZATION 

In this section we present previously developed 
formalizations of the direct, indirect, total and over¬ 
all effects as well as the infectiousness effects as 
background for the development of the new sensi¬ 
tivity analyses under interference in Section 3. 
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2.1 Notation 

Suppose there are N > 1 groups of individu¬ 
als or blocks of units. For i = let n* de¬ 

note the number of individuals in group i and let 
Z i = (Zn ,..., Zi ni ) denote the treatments those n* 
individuals receive. Assume Z^ is a dichotomous 
random variable having values 0 or 1 such that 
Z, can take on 2 ni possible values. Let Z^ de¬ 
note the rii — 1 subvector of Zj with the jth en¬ 
try deleted. The vector Zj is referred to as an in¬ 
tervention or treatment program , to distinguish it 
from the individual treatment Z^. Let Zj and z^ 
denote possible values of Zj and Z^. Define R J 
to be the set of vectors of possible treatment pro¬ 
grams of length j, for j = 1,2,, m. For example, 
R 2 = {(0,0), (0,1), (1,0), (1,1)}. 

Denote the potential outcome of individual j in 
group i under treatment zj as Ijj(zj). Denote Yj(zj) 
as the vector of such outcomes under treatment Zj 
for group i. The notation Yjj(zi) allows for the pos¬ 
sibility that the potential outcome for the individ¬ 
ual j may depend on another individual’s treatment 
assignment in group i, that is, it allows for inter¬ 
ference between individuals within a group. The 
Yij(zj) potential responses can be assumed hxed, 
since they do not depend on the realized random 
assignment of treatments Zj, whereas the observed 
responses Yjj(Zj) do depend on Zj and thus are 
random variables. We also consider potential out¬ 
comes Yj(zj) that are independent and identically 
distributed across blocks. Partial interference is as¬ 
sumed to hold, that is, the outcome of one individual 
can depend on treatment of other individuals in the 
same block, but not those in different blocks. The 
form of the interference within groups is assumed 
unknown and can be of arbitrary form. 

2.2 Treatment Assignment Mechanisms 

Following Hudgens and Halloran (2008), consider 
a two-stage randomization scheme, the first stage 
at the group level, the second at the individual 
level within groups. Let ip and cp denote pararn- 
eterizations that govern the distribution of Zj for 
i = 1,..., N. Corresponding to the first stage of ran¬ 
domization, let S = (Si,..., Sn) denote the group 
assignments with Sj = 1 if the group is assigned to 
ip and 0 if assigned to (p. Let v denote the param¬ 
eterization that governs the distribution of S and 
let C = Yhi Si denote the number of groups assigned 
ip. Following Sobel (2006), Hudgens and Halloran 


(2008) focused on a mixed group and mixed indi¬ 
vidual assignment strategy, whereby a fixed num¬ 
ber of groups were allocated to ip, and within each 
group, a fixed number of individuals were allocated 
to treatment versus control. VanderWeele and Tch- 
etgen Tchetgen (2011a) and Tchetgen Tchetgen and 
VanderWeele (2012) considered what we call a sim¬ 
ple randomization scheme whereby treatment is ran¬ 
domly assigned to different individuals within group 
i according to a Bernoulli probability mass function. 
The causal estimands defined below have the same 
form under either randomization scheme, though the 
different randomization schemes result in subtle dif¬ 
ferences of interpretation. 

2.3 Average Potential Outcomes 

Causal estimands are typically defined in terms 
of averages of potential outcomes which are identi¬ 
fiable from observable random variables. Following 
this approach, the potential outcomes for individual 
j in group i under = z can be written 

(!) Y ij( z i(j),Zij = z), 

for z = 0,1. Because (1) depends on zj^, following 
Sobel (2006), Hudgens and Halloran (2008) defined 
the individual average potential outcome for individ¬ 
ual j in group i under z^ = z by 

Yij {z; ip) = ST Yij (z i(i) = u>, = z) 

wGjOjl }"*- 1 

P^V(Zj(j) U}\Zij z ). 

In other words, the individual average potential out¬ 
come is the conditional expectation of Y tJ (Z t ) given 
Z^ = 1 under assignment strategy ip. In contrast, 
under the simple allocation strategy of VanderWeele 
and Tchetgen Tchetgen (2011a), the potential out¬ 
comes are averaged over the unconditional distri¬ 
bution of Zj (j). Averaging over individuals, define 
the group average potential outcome under treat¬ 
ment assignment z as Yi(z-,ip) = Yl l n j t =i Y ij{ z 'i' l P)/ n i- 
Finally, averaging over groups, define the population 
average potential outcome under treatment assign¬ 
ment z as Y(z-,ip) = Yi(z;ip)/N. The causal 
estimands in the next section are defined in terms 
of the individual, group and population average po¬ 
tential outcomes. The individual estimands were de¬ 
fined in Halloran and Struchiner (1995), the individ¬ 
ual average, group average and population average 
estimands in Hudgens and Halloran (2008). 
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2.4 Direct, Indirect, Total and Overall Causal 
Effects 

The individual direct causal effects of treatment 
0 compared to treatment 1 for the individual j in 
group i were defined by 

EE ij { z i(j)) — Yij( z i(j)iZij = 1) 

( 2 ) 

— Y iji z i[j)i Zij = 0 ). 

The individual average direct causal effect for the 
jth individual in the 1th group was defined by 

(3) aeg(^)=F^(l;V>)-F^(0;^), 

that is, the difference in individual average potential 
outcomes when Zij = 1 and when z^j = 0 under if. 
The group average direct causal effect as defined by 

CEP(if) =F 4 (1;^) -F^) = 

and the population average direct causal effect by 
CE°(if) = F(l; if) - F(0; if) = E^ CE?(if)/N. 

The individual indirect causal effects of treatment 
program z compared with z ’ on individual j in group 
i were defined by 

CE % ( z iU), ) - Y C Hj ) > z a = °) 

(4) 

-F(z' 0) ,4 = °), 

where z' is another n* dimensional vector of treat¬ 
ment random variables. (Note z 1 does not denote 
the transpose of z.) Similar to direct effects, the 
individual average indirect causal effect were de¬ 
fined by CE^^if) = Yij( 0; (f) -Y^fkif). Clearly, 

if if = <f>. then CE^fcf,ip) = 0, that is, there will 
be no individual average indirect causal effects. 
Finally, the group average indirect causal effect 
was defined as CE\(cf,if ) = Yj(0;</>) — Yj(0; 'if) = 

E”h i CEtj (f, if) / n t and the population average 
indirect causal effect as CE 1 (<f,if) = Y(0;j!>) — 

y(0-,if) = EhCEl^^)/N. 

The individual total causal effects for individual j 
in group i were defined as 

CE Tji z i(j), z i(j)) = Y '.f z H.i)-u = F 

(5) 

-Fi(z' 0) ,4=°)- 

The individual average total causal effect was de¬ 
fined by CE?((f,if) = Yij(l-(j)) - Yij(0-,ip), the 
group average total causal effect was defined by 

CEj (<f, 1>) = Fi(l; cf>) -Fj(0; f>) = E £* CE? (<i>, ^)/ 


Hi, and the population average total causal effect 
was defined by CE T (cf,if) = Y(l;<f) — Y(0;^) = 

Eili ^/N- It follows by simple addition 

and subtraction that a total effect is the sum of the 
direct and indirect effects at the individual, indi¬ 
vidual average, group average and population av¬ 
erage levels. For example, CE (4>,if) = Y(\\<f>) — 

F(0; VO = F(i; VO - F(o ; </>) + F(o ; 4>) - F(o ; if) = 

CE? ((f) + CE\(f,if). 

The overall causal effect was defined to be the av¬ 
erage effect of an intervention program relative to 
no intervention. The individual overall causal effect 
of treatment z, compared to treatment z '■ for indi¬ 
vidual j in group i was defined by CE? (z,;, z() = 
Yjj(zi) — Yijfzlf). Similarly, for the comparison of 
cf to if, the individual average overall causal ef¬ 
fect was defined by CE?j((f,if) = Y\j((f) — Y ij(if ), 

the group overall causal effect by CE?(cf,if) = 
Yi{4>) ~ Yj('(/;), and the population overall causal ef¬ 
fect by CE° (<f, if) = Y(cf) — Y(if) where Y (VO = 
EiliEW/^ and Yi(if) = YljL\Yij{if)/rii and 

Y ijW = E u£ {0,l}"i Y ij( z i = <^)P r V>( z i = ^)- Van - 
derWeele and Tchetgen Tchetgen (2011a) showed 
that the overall effect decomposes into the sum 
of an indirect effect and a contrast of two di¬ 
rect effects on the individual average, group av¬ 
erage and population average levels. For example, 
CE°(<f,if) = CE\cf,if) + {CE°(f)PrffZij = 1) - 
CE° (VOPr^i = !)}• 

The quantities defined above under interference 
have two important distinctions from those used 
in causal inference without interference. First, they 
quantify causal effects only for participants in the 
randomized study. Second, they depend on the 
randomization probabilities [through Pr ^(Z^ = 
OJ I Zij — z)\. Although the causal estimands here de¬ 
pend on the assignment mechanism (e.g., comparing 
two different proportions vaccinated), we could al¬ 
ternatively compare allocation strategies of always 
vaccinate versus never vaccinate to recover tradi¬ 
tional causal estimands that do not depend on the 
assignment mechanism. 

The estimands defined above simplify under the 
assumption of no interference between individuals 
within a group since the potential outcomes of the 
jth individual in group i can be written as Y: j (1) and 
Yjj(0). In turn, the individual direct causal effect is 
no longer dependent on the treatment assignment 
vector z i(j) and simply equals Yj(l) — Y ij{ 0)- The 
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corresponding group average direct causal effect be¬ 
comes X]j=i{^d'(l) — ^(0)}/rij, that is, the usual 
average causal effect estimand. By (4), the individ¬ 
ual indirect causal effect equals zero for all individ¬ 
uals assuming no interference. That is, assuming no 
interference implies the treatment has no indirect 
effects. Similarly, by (2) the individual total causal 
effect equals the individual direct causal effect. Like¬ 
wise, at the group average level, under the no inter¬ 
ference assumption the indirect causal effect is zero 
and the direct causal effect equals the total causal 
effect. 

2.5 Inference and Challenges 

Assuming the two-stage randomization and mixed 
allocation strategy, Hudgens and Halloran (2008) 
proposed unbiased estimators for the various pop¬ 
ulation average effects. They provided variance es¬ 
timates under the assumption of stratified infer¬ 
ence, that is, if it matters only how many people 
are allocated to treatment, not exactly which ones. 
Tchetgen Tchetgen and VanderWeele (2012) pro¬ 
vided conservative variance estimators (i.e., guar¬ 
anteed to be no smaller than the true variance in 
expectation), under more general assumptions and 
provided finite sample confidence intervals for the 
various effects without the assumption of stratified 
interference. Liu and Hudgens (2014) further devel¬ 
oped large sample randomization inference for the 
direct, indirect, total and overall causal effects in 
the presence of interference when either the number 
of groups or the number of individuals within groups 
grows large, but not necessarily both. 

2.6 Interference and Infectiousness Effects 

To develop causal estimands for the infectious¬ 
ness effects presented in Section 1.1.5, we follow 
the development of VanderWeele and Tchetgen Tch¬ 
etgen (2011b) and Halloran and Hudgens (2012a). 
Consider a setting with N households (groups) in¬ 
dexed by i = 1,..., N. Each household consists of 
two persons indexed by j = 1,2. We let Z^ denote 
the vaccine status for individual j in household i, 
where Z^ = 1 if the individual received vaccine and 
Zij = 0 if the individual did not. For each house¬ 
hold, Zj = ( Zn,Zi 2 ) denotes the vaccine status of 
the two individuals in the household. We let Vj de¬ 
note the infection status of individual j in house¬ 
hold i after some suitable follow-up in the study. 
We let Yij(zu, Ziz) denote the potential outcome for 
individual j in household i if the two individuals in 


that household i had vaccine status of (zn , z^); we 
treat the potential outcome vector Yj(zn, z^) as a 
random variable that is independent and identically 
distributed across households. 

We assume partial interference, that is, the expo¬ 
sure status of persons in one household in the study 
do not affect the outcomes of individuals in other 
study households. The assumption that clusters con¬ 
stitute isolated pairs would be reasonable in a vac¬ 
cine trial conducted with a relatively small number 
of households in a very large city so that it is un¬ 
likely that the various households in the study would 
interact with one another. We will assume that the 
two individuals in each household are distinguish¬ 
able (e.g., a husband and wife pair) and we will con¬ 
sider a simple randomized experiment in which only 
one of the two individuals (e.g., the wife) is prede¬ 
termined to be randomized to receive a vaccine or 
control and the second person (e.g., the husband) 
is predetermined to be always unvaccinated. We let 
j = 1 denote the individual who may or may not be 
vaccinated (e.g., the wife) and j = 2 the individual 
who is always unvaccinated (e.g., the husband). In 
other settings in which the individual (husband or 
wife) who is subject to vaccination is itself random¬ 
ized (i.e., two-stage randomization), the analysis be¬ 
low could be done separately in those households in 
which the wife was selected for vaccine randomiza¬ 
tion versus those in which the husband was selected. 

The crude (or net) estimator for the infectiousness 
effect on the risk difference scale was defined as 

E[Yi2\Zn = l,Y n = 1] 

( 6 ) 

— E[Yi2\Zn — o,y)i — 1], 

where the expectation is taken over all households. 
This is a comparison of the infection rates for indi¬ 
vidual 2 in the subgroup in which individual 1 was 
vaccinated and infected versus in the subgroup in 
which individual 1 was unvaccinated and infected. 
Even though the vaccine status for individual 1 is 
randomized, conditioning on a variable that occurs 
after treatment, for example, the infection status of 
individual 1, in effect breaks randomization. The net 
estimator for the infectiousness effect could be sub¬ 
ject to selection bias. We are computing infection 
rates for individual 2 for subpopulations that may 
be quite different with respect to individual 1. 

Consider a second contrast proposed by Vander¬ 
Weele and Tchetgen Tchetgen(2011b) and Halloran 
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and Hudgens (2012a): 

£7[Yb(l,0)-y i2 (0,0)| 

( 7 ) 

y i i(i,o) = y i i(o,o) = i]. 

This contrast compares the infection status for in¬ 
dividual 2 if individual 1 was vaccinated, Fj 2 (l,0), 
versus unvaccinated, Fj 2 (0,0), but only among the 
subset of households for whom individual 1 would 
have been infected irrespective of whether individual 
1 was vaccinated, that is, Fji(l,0) = y 1 (0,0) = 1. 
Such a subgroup is sometimes called a principal stra¬ 
tum (Frangakis and Rubin (2002)). The contrast in 
(7) is not subject to selection bias, so it can be con¬ 
sidered a formal causal contrast for the infectious¬ 
ness effect. 

Unfortunately, we do not know which households 
fall into the subpopulation in which individual 1 
would have been infected irrespective of whether in¬ 
dividual 1 was vaccinated. The contrast (7) is, in 
general, unidentified, even when treatment is ran¬ 
domized, though the observable data do provide 
some information about (7). Bounds and sensitivity 
analysis are further facilitated by other assumptions. 

Assumption 1. For all i, (1,0) < Fa (0,0). 

Assumption 1, usually called a monotonicity as¬ 
sumption, states there is no one who would be in¬ 
fected if vaccinated but uninfected if unvaccinated. 
Under Assumption 1, there are three principal strata 
or subgroups of households defined by the joint po¬ 
tential infection outcomes of individual 1 under vac¬ 
cine and control. They are (i) the doomed principal 
stratum in which individual 1 is infected whether 
vaccinated or not, (ii) the protected stratum in 
which individual 1 is infected if unvaccinated and 
uninfected if vaccinated, and (iii) the immune stra¬ 
tum, in which individual 1 does not become infected 
whether vaccinated or not. The causal contrast (7) 
is defined in the doomed principal stratum. 

To simplify notation, let p v = .E[Fj 2 (l, 0)| 

y a (i,o) = y a (o,o) = i], p u = f?[y i2 (o,o)|y i i(i,o) = 

Fa (0,0) = 1], pi = E[Y i2 \Zii = l,Yn = 1] and p 0 = 
E[Yi 2 \Zn =0,Fji = 1], The crude (net) infectious¬ 
ness effect (6) is then just p\ — po, and the causal 
infectiousness effect (7) is p v — p u . Under random¬ 
ization and monotonicity, any household where in¬ 
dividual 1 is infected if vaccinated must be in the 
doomed stratum, so p v = p\- Thus, one component 
of the causal infectiousness effect (7) is identified. 

However, any household where individual 1 be¬ 
comes infected if unvaccinated could be in the 


doomed or protected stratum. Thus, p u is not iden¬ 
tified without further assumptions. However, under 
monotonicity, the ratio p of the proportion in the 
protected stratum to the sum of the proportions in 
the protected and doomed strata is identified by the 
observed data. Thus, we know what proportion of 
the households in which individual 1 received con¬ 
trol and was infected is in the doomed stratum, just 
not which ones, so we do not know what proportion 
of secondary transmissions occurred in the doomed 
strata. Under Assumption 1, Halloran and Hudgens 
(2012a, 2012b) derived upper and lower bounds for 
causal effects on infectiousness that are constrained 
by the relation in the data between p and pq. 

A further possible assumption is the following: 

Assumption 2. A[F* 2 (0,0)|Z ?; i = 0, F;i = 1] < 

£[y 2 (0,0)|Z a = l,Fji = 1], 

Assumption 2 states that the average infection 
rate for individual 2 if both individuals 1 and 2 
were unvaccinated would be lower in the subgroup of 
households for which individual 1 would be infected 
and unvaccinated than in the subgroup of house¬ 
holds for which individual 1 would be infected and 
vaccinated. The assumption might be thought plau¬ 
sible insofar as the subgroup for which individual 1 
was vaccinated and infected might be less healthy 
than the subgroup for which individual 1 was un¬ 
vaccinated and infected; thus, if both people are un¬ 
vaccinated, individual 2 is more likely to be infected 
in the first subgroup than in the second. 

Under Assumptions 1 and 2, VanderWeele and 
Tchetgen Tchetgen (2011b) showed the crude con¬ 
trast in (6) is conservative for the causal con¬ 
trast in (7) in that A[F) 2 (1,0) — Fj 2 (0,0)|F)i(l,0) = 
Fa(0,0) = 1] < E[Y i2 \Za = l,F a = 1] - E^Zn = 
0, Fji = 1], that is, p v — p u <Pi — Po- Analogous re¬ 
sults in fact also hold for the risk ratio, odds ratio 
and vaccine efficacy scales (VanderWeele and Tch¬ 
etgen Tchetgen (2011b)). 

The approach may be employed outside of the vac¬ 
cine context. For example, in an observational study 
in which the treatment is a smoking cessation pro¬ 
gram in which one of two persons in a household 
participated. The participation of the first person 
might affect the smoking behavior of the second. 
This might occur either (i) because smoking cessa¬ 
tion for the first person encourages the second to 
stop smoking or because (ii) even if the first person 
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does not stop smoking, the second person might nev¬ 
ertheless be exposed to some of the smoking cessa¬ 
tion program materials. One could evaluate this sec¬ 
ond type of effect (the analogue of the infectiousness 
effect) by applying the approach described above. 

3. INTERFERENCE AND SENSITIVITY 
ANALYSIS 

3.1 Overview 

In this section we develop sensitivity analysis tech¬ 
niques that can help assess the presence of causal ef¬ 
fects in two settings where these effects in the pres¬ 
ence of interference are not identified. These causal 
effects may not be identified either because the 
treatments are not randomized or because, even if 
the treatments are randomized, the spillover effects 
of interest involve conditioning on a post-treatment 
variable thereby breaking randomization. Building 
on the previous sections, we first consider the setting 
of a randomized trial where the spillover effect of 
interest is not identified by randomization alone be¬ 
cause of conditioning on a post-randomization vari¬ 
able as in the infectiousness effect described in Sec¬ 
tion 2.6. We present sensitivity analysis methods for 
assessing this infectiousness effect in part by adapt¬ 
ing research in Robins, Rotnitzky and Scharfstein 
(2000). We then consider the setting of observational 
data such as in Hong and Raudenbush (2006) in 
which causal effects and spillover effects may not 
be identified due to one or more unmeasured con¬ 
founding variables. We present two sensitivity anal¬ 
ysis techniques for causal effects in the presence of 
interference that extend analogous results for causal 
effects under no-interference (Robins, Rotnitzky and 
Scharfstein (2000); VanderWeele and Arah (2011)) 
to the setting of causal effects and spillover effects 
in the presence of interference. 

3.2 Sensitivity Analysis for the Infectiousness 
Effect 

In Section 2.6 we described two previously de¬ 
veloped approaches to bounds on the infectious¬ 
ness effects. Here we develop methods for sensi¬ 
tivity analysis for the infectiousness effect. We fol¬ 
low the development first in VanderWeele and Tch- 
etgen Tchetgen (2011b) and Halloran and Hud¬ 
gens (2012a) and then in Hudgens and Halloran 
(2006); further technical development is given in the 
Appendix. See also VanderWeele and Tchetgen Tch¬ 
etgen (2011b) and Hudgens and Halloran (2006) for 


concrete applications. A simple sensitivity analysis 
approach also follows from the development of Van¬ 
derWeele and Tchetgen Tchetgen (2011b). We use 
the same notation as in Section 2.6. As noted in Sec¬ 
tion 2.6, under monotonicity Assumption 1, we have 
that p v = pi and, thus, to obtain the causal infec¬ 
tiousness effect, we need to express p u in terms of the 
observed data and sensitivity analysis parameters. 
We will describe three different parameterizations. 

First, let 9 = E[Y i2 (0,0)\Zn = 1,Vi = 1] - 
E[Y i2 (0,0)\Z a = 0,l)i = 1] denote the sensitivity 
parameter which contrasts the average counterfac- 
tual infection rates for individual 2 if both individ¬ 
uals 1 and 2 were unvaccinated in the subgroup of 
households for which individual 1 is vaccinated and 
infected versus the subgroup of households for which 
individual 1 is unvaccinated and infected. It follows 
from the development in VanderWeele and Tchet¬ 
gen Tchetgen that, under monotonicity, p u =po + 9 
and, thus, 

Pv-Pu=Pi~Po- 0. 

In other words, to obtain the infectiousness effect 
under monotonicity, we can calculate the crude in¬ 
fectiousness effect in ( 6 ), specify the sensitivity pa¬ 
rameter 9 and subtract the sensitivity parameter 9 
from the crude estimate to obtain the infectiousness 
effect. We can vary 9 over a range of plausible values 
in a sensitivity analysis to produce a range of plausi¬ 
ble values for the infectiousness effect. The sensitiv¬ 
ity analysis parameter is subject to certain empiri¬ 
cal constraints as described below. However, because 
of the simple relationship above, a corrected confi¬ 
dence interval under sensitivity parameter 9 can be 
obtained simply by subtracting 9 from both limits of 
the confidence interval for the crude estimate in ( 6 ). 

We can also use a similar approach but with a 
different parameterization of the sensitivity anal¬ 
ysis parameters. Following Hudgens and Hallo¬ 
ran (2006), we can vary 7 = -F[V 2 ( 0 , 0 )|Vi( 1 > 0 ) = 
0, V;i(0,0) = l] 5 the probability of secondary trans¬ 
mission in the protected stratum when individual 
l receives control with bounds set by constraints 
of the data (Halloran and Hudgens, 2012a, 2012b). 
The quantity 7 is not identifiable from the observed 
data without further assumptions, but once a value 
of 7 is assumed, then the probability of secondary 
transmission in the doomed stratum is fixed and, 
thus, p u is identified. Varying 7 , the infectiousness 
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effect (7) on the risk difference scale can be obtained 
as pi — p u , where p u is given by 


Pu = 


Po ~ 7 (! - P) 


P 


and where 7 can vary between 
Po - P 


max< 0 , 


1 ~P 


< 7 < min< 1 , 


Po 

1 ~ P 


with the left side giving rise to the upper bound for 
the causal infectiousness effect and the right side 
giving rise to the lower bound on the risk difference 
scale. Note that a drawback of this approach is that 
the parameter 7 is constrained by the observed data 
and similar restrictions are imposed on 6 above by 
virtue of the identity 6 = po — po ~ 7 ^ 1 ~ p ^ . 

As a third parameterization, we could follow an 
approach to sensitivity analysis developed in Scharf- 
stein, Rotnitzky and Robins (1999) and Robins, 
Rotnitzky and Scharfstein (2000). This approach 
performs sensitivity analysis with a bias parameter 
/? on the ratio scale. Gilbert, Bosch and Hudgens 
(2003) and Hudgens and Halloran (2006) adapted 
this approach for sensitivity analysis for causal ef¬ 
fects on post-infection outcomes, the former in the 
continuous post-infection outcome scenario, the lat¬ 
ter for binary post-infection outcomes. Hudgens and 
Halloran (2006) and Halloran and Hudgens (2012a) 
suggested that this approach to sensitivity analysis 
could be used for infectiousness effects taking as the 
intermediate infection outcome the infection status 
of individual 1 and as the potential post-infection 
outcome the infection status of individual 2. Within 
the context of the infectiousness effect, again under 
monotonicity Assumption 1 , the bias parameter (3 
can be expressed as 


exp(/3) 

= (P(y i2 (0,0) = l\Zn = l,Y n = 1) 

/P(Xi 2 (0,0) = 0\Z il = l,Yn = 1)) 

/(P{Y i2 ( 0,0) = 1|W (0,0) = 1 , Y a ( 1,0) = 0} 
/P{Y i2 ( 0,0) = 01^1(0,0) = l,17i(l,0) = 0}). 


The bias parameter /3 is the log of odds ratio com¬ 
paring the risk of infection if individual 1 is not 
vaccinated among (i) the doomed stratum and (ii) 
the protected. Note this is not simply a different 
scale than the bias parameter 6 above (odds ra¬ 
tio versus risk difference) but also a comparison 


of different subpopulations. Once this bias param¬ 
eter is specified, then it can be shown that p u = 
E[Y i2 (0,0)\Y n (l,0) =17i(0,0) = 1] is the positive 
root of (— 6 ± Vb 2 — 4:zc)/2z, where 


£ = exp(/3)p 0 , 


b 


( P(Yn = l\Zn = 1)\ 
V P(Y il = l\Z il = l)J 

- exp(/3)p 0 +Po ~ exp(/3) 


(exp(/3) - 1} 


c = 


p(jii = i\Zii = ri 

P(Y, a = 1| Z a = 1) 


{exp(/3) - 1}. 


For further discussion of inference for this approach 
to sensitivity analysis see Hudgens and Halloran 
(2006) and Jerniai et al. (2007). 

In each of the three parameterizations above, 
once we have obtained p u = E[Yi 2 (0, 0)|Y)i (1,0) = 
171 ( 0 , 0 ) = 1 ], we can obtain the infectiousness ef¬ 
fect on the difference, risk ratio, odds ratio or in¬ 
fectiousness effect scales by p v — p u , p v /p u , Pv{ 1 — 
Pu)/{Pu{^-~ Pv)} and 1 -Pv/Pu, respectively. 

We have focused here on the setting of a random¬ 
ized trial, but the approach is potentially applicable 
to observational studies as well if, conditional on 
some set of covariates C , the treatment was jointly 
independent of the counterfactual outcomes (i.e., ef¬ 
fectively randomized within strata of C). The sensi¬ 
tivity analysis parameters would have to be condi¬ 
tional on C. 


3.3 Sensitivity Analysis for Spillover Effects 

Under Unmeasured Confounding: 

Approach 1 

We now consider a setting in which causal ef¬ 
fects and spillover effects under interference are not 
identified due to unmeasured confounding. Adjust¬ 
ment is often made for covariates to attempt to con¬ 
trol for such confounding. However, in an observa¬ 
tional study we can never be sure that the control 
is adequate. One or more unmeasured confounders 
may bias effect estimates. Confounding control be¬ 
comes even more complex in settings with interfer¬ 
ence since when one individual’s outcome is under 
consideration, control will often need to be made for 
the covariates of other individuals in the same clus¬ 
ter (Tchetgen Tchetgen and VanderWeele (2012); 
Ogburn and VanderWeele (2012); Perez-Heydrich 
et al. (2013)). Unmeasured confounding can thus 
operate either through the unmeasured covariates 
for the focal individual or for other individuals in 
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the same cluster. In this subsection, we apply and 
extend the sensitivity analysis approach of Vander- 
Weele and Arah (2011) to allow for settings with 
interference and spillover effects. In the next sub¬ 
section we consider an extension of the sensitivity 
analysis approach of Robins, Rotnitzky and Scharf- 
stein ( 2000 ) to allow for interference and spillover 
effects. 

We consider a general observational setting such 
as that employed by Hong and Raudenbush (2006) 
wherein individuals are clustered in groups such 
that individuals within groups may influence one 
another but there is no interference between groups. 
We make the stratified interference assumption 
above and further assume, following Hong and 
Raudenbush, that the potential outcome of per¬ 
son j, Yij(zi), depends on the treatment received 
by the individuals in cluster i other than person j, 
Zj(j\. only through some known many-to-one scalar 
function g(z^) so that Y^-(zj) can be written as 
Yij(zij,g( Zj(j))). For example, g(z , L might be the 
mean of z^jy Let Gij = g(7iyy). Suppose that for 
all i,j, Zij is determined by simple randomization. 
We then have that 

E[Y(z,g)\Z = z,G = g]=E[Y(z,g)}. 

Hong and Raudenbush (2006) considered a variation 
on this assumption in the context of observational 
data. Specifically, for some covariate vector , they 
assumed that 


( 8 ) 


E[Y(z,g)\Z = z,G = g,L = l\ 
= E[Y(z,g)\L = l} 


and from this it follows that 


E[Y{z,g)\L = l} = E[Y\Z = z,G = g,L = l\, 

where the right-hand side can be estimated with ob¬ 
served data. Hong and Raudenbush (2006) also al¬ 
lowed Lij to contain cluster level covariates along 
with cluster aggregates of individual level covari¬ 
ates. Note, however, that ( 8 ) requires that Yij(z,g) 
be mean independent of both Z tJ and g(Zyy) condi¬ 
tional on Lij. If, for each individual, Zy is random¬ 
ized conditional on , although this will imply that 
Yij(z,g ) is mean independent of Zj 3 conditional on 
L^, it does not necessarily guarantee that Yij(z,g) 
is mean independent of g(Zyj^) conditional on Ljj . 
Let L i(j\ denote the vector of all covariates L^ for 


all individuals in cluster i other than individual j. 
We might, instead of (8), consider 

E[Y(z,g)\Z = z,G = g,L = l, h( L) = h] 

(9) 

= E[Y(z,g)\L = l,h(L) = h], 

where h(Lyy) is a known function of Lyy. However, 
once again, with (9), even if, for each individual, Zy 
were randomized conditional on Ljj, hiLyy), this 
would not guarantee that Yy{z,g) is mean inde¬ 
pendent of g(Zj(j)) conditional on Ly, h(L i jj' ) ) un¬ 
less /i(LjQ)) = L i(jy See Ogburn and VanderWeele 
(2012) for discussion of causal structures for which 
assumptions (8) or (9) will hold. Under assumption 

(9) , we have 

E[Y(z,g)\L = l,h(L) = h] 

= E[Y\Z = z,G = g,L = l, h( L) = h], 

where again the right-hand side can be estimated 
with observed data. From this one could obtain con¬ 
ditional direct, indirect and total effects, namely, 

E[Y{z,g)\l,h]- E[Y(z' ,g)\l,h], 

E[Y(z,g)\l,h] — E[Y(z,g')\l,h\, 

E[Y{z,g)\l,h)-E[Y{z',g')\l,h]. 

These contrasts are important insofar as they allow 
one to assess the relative importance for an individ¬ 
ual’s outcome of changing an individual’s own treat¬ 
ment versus the treatment of other individuals. In 
other words, the effects allow one to assess the rela¬ 
tive importance of spillover. Marginal effects, involv¬ 
ing counterfactuals of the form E[Y(z,g)\, could be 
obtained by averaging over the distributions of Ljj 
and /i(Lj(j)). 

Suppose now that we have unmeasured confound¬ 
ing by one or more unmeasured confounders Uij and 
let Ujfj) denote the vector of U l3 for all individuals 
in cluster i other than individual j. Suppose that 
the analogue of assumption (9) holds conditional on 
observed L t j, h(Lyj^) and unobserved Uij, u(Uj(j)) 
for some scalar function v so that 

E[Y(z,g) | 

(10) Z = z,G = g,L = l,h(L) = h,U,v(V)] 

= E[Y(z,g)\L = l,h(L) = h,U,v(\J)}, 

but that (9) does not hold when we do not con¬ 
dition on Uij,v(XJ i (j')). Without data on U tJ causal 
effects are not identified. Let H = /i(Lj^)) and V = 
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Following the sensitivity analysis approach of 
VanderWeele and Arah (2011) for causal effects un¬ 
der no-interference, we express the difference be¬ 
tween the causal effect 

E[Y(z,g)\l,h}-E[Y(z',g')\l,h} 

= '52{ E i Y \ z ,9,hh,u,v\ 

U,V 

~ E[Y\z',g',l,h,u,v]} 

■ P(u, v\l, h) 

and the biased estimand 

E[Y\z,g,l,h]-E[Y\z',g',l,h} 

in terms of sensitivity analysis parameters. Let B = 
{E[Y\z, g, l, h] - E[Y\z ', g', l,h]}~ {E[Y(z, g)\l , h) - 
E[Y(z f , g')\l, h]} denote this difference. Technical de¬ 
velopment is given in the Appendix. 

Let u* and v* denote arbitrary reference values 
for U and V, respectively. Under assumption (10) 
we have that 

B = Y^{E(Y\z,g,l,h,u,v)-E(Y\z,g,l,h,u*,v*)} 

U,V 

■ { P(u,v\z,g,l,h ) -P(u,v\l,h)} 

(11) -^2{E(Y\z',g',l,h,u,v) 

U,V 

— E(Y\z',g',l,h,u* ,v*)} 

■ {P(u,v\z',g',l,h ) - P(u,v\l,h)}. 

To obtain the bias factor B, one could thus specify 
the effect of the unmeasured confounders U and V 
on the outcome, E(Y\z,g,l,h,u,v) — E(Y\z,g,l,h, 
u*,v*), for (Z,G) = ( z,g) and (Z,G) = ( z',g '), 
and also how the distribution of U and V differs 
when ( Z,G ) = (z,g) versus (Z,G) = ( z',g'), that is, 
P(u,v\z,g,l,h) and P(u,v\z', g' ,l,h). One can use 
these sensitivity analysis parameters to calculate the 
bias factor in (11) and then subtract the bias factor 
B from the estimate of the causal effect using the 
observed data E[Y\z,g,l,h] — E[Y\z',g',l,h] to ob¬ 
tain a corrected effect estimate for E\Y(z, q)\l,h] — 
E[Y(z',g')\l,h]. 

Note that the expression for the bias factor in 
( 11 ) makes no assumption beyond assumption ( 10 ) 
that control for observed ( L,H ) and unobserved 
(U, V ) would suffice to control for confounding 
of the effect of (Z, G ) on Y; it allows for mul¬ 
tiple unmeasured confounders. However, the use 


of the bias formula in ( 11 ) requires specifying a 
large number of parameters: E(Y\z,g,l,h,u,v) — 
E(Y\z,g,l,h,u*,v*) for every value of u,v and the 
distributions P(u,v\z,g,l,h) and P(u,v\z' ,g' ,l,h). 

Under some simplifying assumptions, expression 

(11) reduces to a much easier to use formula. In 
particular, suppose that there is a single unmea¬ 
sured confounder U and that V = w(Uj(j)) is scalar. 
Suppose also that the effects of U and V = u(Ujfj)) 
are additive in the sense that E(Y\z,g,l,h,u,v) — 
E(Y\z,g,l,h,u*,v*) = A (u — u*) + t(v — v*) for 
(Z,G) = (z,g) and ( Z,G) = ( z',g In the Appendix 
it is shown that under these assumptions 

B = \{E[U\z,g,l,h}-E[U\z',g',l,h}} 

( 12 ) 

+ T{E[V\z,g,l,h]-E[V\z',g',l,h}}. 

To use this simplified bias formula, one only needs 
to specify the effect, A, for a one unit increase 
in the unmeasured confounder U t j , the effect r 
of a one unit increase in the scalar functional of 
the unmeasured confounders of the other mem¬ 
bers of the group, u(Uj(j)), and how the means of 
Uij and v(UjQ-)) differ when (Z,G) = ( z,g ) versus 
when (Z,G) = {z' 1 g'). Once these sensitivity analy¬ 
sis parameters are specified the bias factor B can 
be calculated using formula ( 12 ) above and then 
B could be subtracted from the estimate of the 
causal effect using the observed data E[Y\z, g, l, h] — 
E[Y\z',g',l,h] to obtain a corrected effect estimate 
for E[Y(z,g)\l, h] — E[Y ( z',g')\l , h\. Under this sim¬ 
plified approach because the bias factor involves 
only the sensitivity analysis parameters and not 
the observed data, a corrected confidence inter¬ 
val could be obtained by subtracting B from both 
limits of a confidence interval for E[Y\z,g,l,h\ — 
E[Y\z', g', l, h\. A sensitivity analysis consists of re¬ 
porting the causal contrasts under a range of plau¬ 
sible values of r and A. The values r = A = 0 cor¬ 
respond to the assumption of no unmeasured con¬ 
founders. Selecting a plausible range of parameters 
could be done using subject matter expertise, by ex¬ 
ternal data from other studies or by including a very 
wide range of parameters that are thought to include 
those that would constitute very extreme values. Al¬ 
ternatively, one could examine the most important 
measured confounder and assess the magnitude of 
the corresponding parameters for the most impor¬ 
tant measured confounder; one could then consider 
whether an additional unmeasured confounder with 
parameters set equal to that of the most impor¬ 
tant measured confounder would substantially alter 
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results. This would allow an investigator to assess 
whether an unmeasured confounder would have to 
be stronger than the most important measured con- 
founder to substantially alter the results. 

Consider again the substantive example of Hong 
and Raudenbush (2006) described in Section 1.1.3. 
As noted above, Hong and Raudenbush (2006) ex¬ 
amined the effect of kindergarten retention on read¬ 
ing test scores allowing for interference by allow¬ 
ing the retention of other students at the school 
to affect a child’s reading test scores. They as¬ 
sumed that treatment assignment at both the school 
and the individual level was ignorable given a 
number of observed individual-level, school-level 
and school-aggregated-individual-level characteris¬ 
tics. Using a propensity score-based approach, they 
estimated, in the notation above, the contrasts 
E[Y(z = 1 ,g = 1 )]-E[Y(z = 0 ,g = 1)] and E[Y(z = 
1 ,g = 0)] — E[Y(z = 0,g = 0)], where g = 1 denotes 
high-retention school and g = 0 a low-retention 
school. They found that, in low-retention schools, 
their estimates indicated that the effect on reading 
scores of a student being retained versus being pro¬ 
moted was —8.18 (95% Cl: —10.02, —6.34), and in 
high retention schools the effect estimate was —8.86 
(95% Cl: —11.56, —6.16). A standard deviation in 
reading test scores in this sample is 13.48 points. 

Hong and Raudenbush also went through a sen¬ 
sitivity analysis argument for their results. They 
noted that the strongest predictor of current test 
scores were lagged test scores, but that it was un¬ 
likely that there was any unmeasured covariate that 
would predict their outcomes so strongly. They con¬ 
sidered instead whether unmeasured individual and 
school covariates that had effects on readings scores 
that were equal to those of the measured covari¬ 
ates with second strongest association with read¬ 
ing scores would suffice to explain away the effect 
estimates. Using an argument based on a formula 
similar to (12), they reported that unmeasured in¬ 
dividual and school confounders that had an effect 
as large as the second most important measured in¬ 
dividual and school level covariates would shift the 
estimate in high-retention schools to —4.25 (95% 
Cl: —6.95, —1.54) and thus not suffice to bring the 
confidence interval to include 0. However, in low- 
retention schools, unmeasured individual and school 
confounders that had an effect as large as the sec¬ 
ond most important measured covariates would shift 
the estimate and confidence interval in low-retention 
schools to —0.60 (95% Cl: —2.44, 1.24) and thus 


would suffice to bring their confidence interval for 
the effect in low-retention schools to include 0. The 
effects in high-retention schools seem more robust 
to the possibility of unmeasured confounding. In 
their analyses, Hong and Raudenbush (2006) used 
a similar expression to (12), but in their paper they 
did not provide a derivation of this formula and did 
not articulate the assumptions needed for the use of 
the formula. We have provided the derivations and 
assumptions required here. Moreover, we have also 
provided a more general expression, that in (11), 
that is applicable under much weaker assumptions. 

We have considered here a sensitivity analysis ap¬ 
proach for causal effects and spillover effects in the 
presence of interference. In other contexts, questions 
concerning whether the effect of a treatment on an 
outcome is mediated by some intermediate may be 
of interest. In settings in which mediation is of inter¬ 
est and interference occurs at the level of the media¬ 
tor so that the mediator for one unit may affect the 
outcomes for other units (cf. VanderWeele (2010a); 
VanderWeele et ah, 2013), a similar sensitivity anal¬ 
ysis approach for unmeasured confounding of the 
mediator and the outcome could be developed by 
applying and extending the results of VanderWeele 
(2010b) for direct and indirect effects from the no- 
interference setting to a setting with interference by 
following an analogous approach to that presented 
above. 


3.4 Sensitivity Analysis for Spillover Effects 
Under Unmeasured Confounding: 

Approach 2 

In this subsection we consider an alternative sen¬ 
sitivity analysis approach to assess the influence of 
unobserved confounding for direct and spillover ef¬ 
fects in general settings similar to those considered 
by Hong and Raudenbush (2006) described above. 
The following developments follow closely from anal¬ 
ogous sensitivity analysis techniques recently pro¬ 
posed in the context of mediation analysis (Tchet- 
gen Tchetgen (2011); Tchetgen Tchetgen and Sh- 
pitser (2012)). In order to formalize the approach, 
suppose that we wish to make inferences about the 
following causal effects: 

7 d (z,g,l,h) = E[Y(z,g) -Y(z 0 ,g)\z,g,l,h], 
l s (g,l,h) =E[Y(z 0 ,g ) - Y(z 0 ,g 0 )\g,l,h], 


where, unless stated otherwise, throughout G and 
H are left unrestricted, that is, G^j) = g( Z^) = 
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Z, r . H;u = h(hi(j)) = Lj/j). These effects are ver¬ 
sions, allowing for interference, of the so-called ef¬ 
fect of treatment on the treated, which have been 
studied extensively in the absence of interference 
by econometricians, epidemiologists and social sci¬ 
entists. The first contrast 7 d (z,g,l,h) captures the 
direct effect of Z tJ on Y^j conditional on the person’s 
observed exposure and the cluster’s observed 
data (ZIn contrast, 7 s (g,/,/i) is the 
spillover causal effect of Z,^ on Yij(zij = z$) within 

levels of (Z, ; //, ■,). Note that ^ d (zo,g,l,h) = 
7 s (go ,l,h) = 0, so that these effects are relative 
to the reference average potential outcome under 
(zo,9o)- 

Consider the pair of no unobserved confounding 
assumptions: 


E[Y{z 0 ,g)\z,g,l,h] = E[Y(z 0 ,g)\z 0 ,g,l,h ], 
E[Y(z 0 ,g 0 )\g,l,h\ = E[Y(z 0 ,g 0 )\g 0 ,l,h}. 

It is straightforward to show that under these as¬ 
sumptions, 7 d and j s are identified by 

7 d ’ f (z,g,l,h) =E[Y\z,g,l,h] 

~ E[Y\z 0 ,g,l,h], 

1 sA (gJ,h) =E[Y\z 0 ,g,l,h] 

~E[Y\z 0 ,go,l,h\. 


This shows that the above no unobserved con¬ 
founding assumption suffices to identify direct and 
spillover effects (on the treated) using a standard 
regression analysis approach to model E[Y\z, g, l, h]. 
Next, supposing that the no unobserved confound¬ 
ing assumption does not hold, we define the follow¬ 
ing selection bias functions: 

5 d (z,g,l,h) = E[Y(z 0 ,g)\z,g,l,h\ 

(13) 

- E[Y(z 0 ,g)\z 0 ,g,l,h\, 
$ s {g,l,h) = E[Y(z 0 ,g 0 )\g,l,h] 

(14) 

- E[Y(z 0 ,go)\go,l,h], 


where 5 d (zo, g,l,h) = 5 s (go, l, h) = 0. These selec¬ 
tion bias functions come about naturally upon con¬ 
trasting, on the additive scale, each of the obser¬ 
vational conditional association r y d ^(z,g,l,h) and 
7 s, '(g,l,h), with their corresponding causal analog, 
7 d (z 1 g, l, h ) and 7 s (g, l, h), respectively. To illustrate 
in the simple context of binary Z, one can verify that 


the confounding bias quantified on the additive scale 
is equal to 

7 d ’ t ( 2 = l,g,l,h)-') d {z= l,g,l,h) 

= E[Y\z = l,g,l,h}-E[Y\z 0 ,g,l,h] 
-E[Y(z = l,g)\z = l,g,l,h] 

+ E[Y(z 0 ,g)\z = 1 ,g,l,h\ 

= E[Y(z = l,g)\z = l,g,l,h\ 

- E[Y(zo,g)\z 0 ,g,l,h\ 

-E[Y(z = l,g)\z = l,g,l,h] 

+ E[Y(z 0 ,g)\z = 1 ,g,l,h] 

= E[Y(z 0 ,g)\z = 1 ,g,l,h\ 

- E[Y(z 0 ,g)\z 0 ,g,l,h\ 

= S d (z=l ,g,l,h), 

which makes clear the central role of the selec¬ 
tion bias function 6 d . A generalization of the above 
derivation gives similar expressions for <5 s (g, l, h ) and 
also extends beyond binary Z. Furthermore, this 
derivation also makes clear that the presence of con¬ 
founding implies that at least one of the following 
must hold: 

either S d (z,g,l,h)^ 0 for some (z,g,l,h) 

or 5 s (g,l,h) for some (g, l, h). 

The first condition implies "y d is not identified, while 
the second case implies 7 s is not identified. Thus, we 
may proceed as in Robins, Rotnitzky and Scharf- 
stein ( 2000 ) and recover causal inferences by as¬ 
suming the selection bias functions, 5 d (z,g , l, h ) and 
5 s (g,l,h), that encode the magnitude and direction 
of unmeasured confounding, are known. Suppose 
that higher values of Y are beneficial to one’s health. 
If 5 d (l,g,l,h) > 0, then, on average, an individual 
j in cluster i with {Gj(j) = g,Lij = l,H^ = h} 
and exposure value Z^ = 1 has higher potential 
outcomes Yij(zo = 0 ,g) than an individual in the 
same cluster and the same stratum {G«(j) = g,Tjj = 
= h} but unexposed Z t j = 0 , that is, health¬ 
ier individuals are more likely to receive the expo¬ 
sure conditional on the exposures of other people 
in the cluster and the observed confounders for the 
cluster. On the other hand, d d (l, g, l, h) < 0 suggests 
confounding by indication for exposure, that is, un- 
healthier individuals are more likely to be exposed. 
Likewise, if 5 s (g,l,h ) > 0 for all g / go indicates 
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that, on average, an individual in a cluster with con- 
founders {Lij = l, = h} and g{ Z i(i) ) = g* / g 0 
has higher potential outcomes Yij(zo = 0,go) than a 
comparable individual in a comparable cluster with 
baseline exposure value g( Z^) = go- In the special 
case where g(Z^) = = 0, one has that clus¬ 

ters with no exposed individual tend, on average, 
to be less healthy than clusters with one or more 
individuals exposed. 

The approach to inference in the presence of con¬ 
founding involves the following reparameterization 
of the conditional mean function E[Y\z,g,l,h] in 
terms of the causal contrasts 7 d and 7 s and the 
selection bias functions 5 d and 5 s . To state the 
reparameterization, suppose for the moment that 
f(z,g\l,h) = /(Zj = (z,g)\Li = ( l,h )) is known, then 
one can verify that for each unit in cluster i, 

E[Y\z,g, l, h] 

= 7 d (z,gJ,h) + 5 d (z,g,l,h) 

1 

~^ 2 ^ d {z,g,l,h)f(z\g,l,h) 

z =0 

+ 7 s (g,l,h)+ 6 s (g,l,h) 

]T S s (g(z*),l,h)f(z*\l,h) + q(l,h), 

z*e{o,i} n i _1 

where 

q(l,h) = E[Y(z 0 ,g 0 )\l,h}. 

For hxed 5 d and 5 s given by equations (13) and (14), 
the causal contrasts 7^ and 7 s are nonparametrically 
identified and can be estimated by fitting the above 
regression model. 

In practice, due to the high dimensionality of L, 
often encountered in applications, parametric mod¬ 
els must be used to reliably estimate 7 d (z,g,l,h), 
7 s ( 5 , l, h), f(z,g\l, h) and q(l,h). The above repa¬ 
rameterization is particularly advantageous in that 
it ensures variation independence of parameters of 
working models for these various quantities. A de¬ 
scription of parametric maximum likelihood and 
generalized estimating equations estimation is given 
in the Appendix. 

3.5 A Comparison of Sensitivity Analysis 
Techniques 

It is instructive to compare the sensitivity analysis 
techniques given in Sections 3.3 and 3.4. We begin 
by noting that the causal estimand targeted by the 


two methods differ. The first approach aims to make 
inferences about 

E[Y(z,g) -Y(z',g)\l,h\ and 

E[Y( Z ',g)-Y(z',g')\l,h], 

while the second approach targets the causal con¬ 
trasts 

E i Y (z,g) -Y(z',g)\z,g,l,h\ and 

E[Y(z\g)-Y(z',g')\g,l,h}. 

In the absence of confounding, the contrasts tar¬ 
geted by the two approaches coincide, but when un¬ 
measured confounding is present, the first approach 
gives direct and spillover effects for the subset of 
individuals with {l,h}, while the second approach 
delivers inferences about the direct effect for indi¬ 
viduals with {z,g, l,h} and the spillover effect for 
individuals with {g,l, h}, that is, interference effects 
of treatment on the treated. This distinction has im¬ 
plications for the corresponding sensitivity analysis 
techniques. Although both approaches require speci¬ 
fying unidentified parameters, in the first technique 
the parameters correspond to particular causal ef¬ 
fects (of U and V), whereas in the second tech¬ 
nique the parameters do not correspond to causal 
effects. More specifically, in the first approach, a 
parametrization of the bias expression ( 11 ) involves 
quantifying the causal interaction 

{E(Y\z,g,l,h,u,v) - E(Y\z,g,l,h,u*,v*)} 

- {E(Y\z',g',l,h,u,v) 
-E(Y\^,gf,l,h,u*,v*)} 

= {E(Y\z,g,l,h,u,v) - E(Y\z',g',l,h,u,v)} 

~ {E(Y\z,g,l,h,u*,v*) 
-E(Y\z',g[,l,h,u*,v*)} 

of the effect of ( Z,G ) within levels of (L, H,U,V). 
The first sensitivity analysis approach requires mak¬ 
ing a judgement not only about the nature of U 
(i.e., binary, polytomous, multivariate, etc.), and the 
magnitude and the direction of unmeasured con¬ 
founding, but also about the magnitude and direc¬ 
tion of effect heterogeneity on the additive scale. In 
contrast, the second sensitivity analysis technique 
directly quantifies the magnitude and direction of 
unmeasured confounding without making any ref¬ 
erence to a specific U and, therefore, it does not 
involve making any judgement about unidentified 
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causal effects. While making a judgement about the 
nature of U (i.e., binary, polytomous, multivariate, 
etc.) in the first approach could be difficult in prac¬ 
tice, the second approach, to ensure that posited 
models are compatible, requires that the user posit 
parametric models for f(z\g,l,h) and /(G = g\l,h), 
an additional modeling requirement not needed by 
the first approach. Both of these densities are, how¬ 
ever, nonparametrically identified from the observed 
data and, therefore, standard goodness-of-fit tools 
may be adopted to ensure a reasonable fit to the 
data. The first approach can be of particular use 
if subject matter expertise can help determine the 
nature of the unmeasured confounders and/or if 
prior analyses with other data for which these con- 
founders are available can be used to help inform 
the value of the sensitivity analysis parameters. 

The difference between the two techniques also 
has interesting but subtle implications if these tech¬ 
niques are used to construct tests of the sharp null 
of no treatment effects. If an investigator were in¬ 
terested in testing the sharp null of no treatment 
effects using the first sensitivity analysis approach, 
care would need to be taken to ensure that the spec¬ 
ification of the sensitivity analysis parameters was 
compatible with the sharp null, for example, by en¬ 
suring the interaction function in the above display 
is 0 as, for example, in the simplified expression in 
(12). In contrast, with the second sensitivity analysis 
approach, any specification of the sensitivity analy¬ 
sis parameters will be compatible with the sharp 
null, and so this issue of compatibility is not simi¬ 
larly a concern. 

4. DISCUSSION 

In this paper we have reviewed definitions and 
approaches to causal inference in the presence of 
interference. We have developed various sensitiv¬ 
ity analysis approaches when the causal effects and 
spillover effects of interest are unidentified either 
because randomization does not suffice for identifi¬ 
cation (Hudgens and Halloran (2006); VanderWeele 
and Tchetgen Tchetgen (2011b); Halloran and Hud¬ 
gens, 2012a, 2012b) or because of unmeasured con¬ 
founding in an observational study. 

We have extended existing sensitivity analysis ap¬ 
proaches (Robins, Rotnitzky and Scharfstein (2000); 
VanderWeele and Arah (2011)) from the setting of 
no-interference to allow for interference. Many set¬ 
tings in the social sciences in which causal effects 


and spillover effects are of interest are observational 
settings with interference and the results presented 
here will likely be useful in those settings. Further 
work could be done on sensitivity analysis for un¬ 
measured confounding in other settings in which 
there are not multiple independent clusters (e.g., 
Aronow and Sarnii (2013)) or in settings involv¬ 
ing the assessment of causal effects in social net¬ 
works (Christakis and Fowler (2007); VanderWeele 
( 2011 )). 

More generally, numerous further challenges re¬ 
main in the development of methods for causal in¬ 
ference under interference. Inference may become 
additionally challenging for treatment with multi¬ 
ple levels, as the number of combinations will in¬ 
crease dramatically. Interference patterns might also 
depend on the covariates of individuals in a clus¬ 
ter in complex ways. Furthermore, the approach for 
defining causal estimands for infectiousness where 
the covariates of individuals in the group are taken 
into account becomes unwieldly when the clusters 
are larger than two, posing an additional chal¬ 
lenge for future research. When dealing with cluster- 
randomized studies in which the clusters are large, 
the number of clusters may be small, making infer¬ 
ence difficult. In contrast, in the household studies, 
the number of clusters may be large, but the number 
in the households small. 

One of the limitations of the approaches for causal 
inference with interference presented in this paper 
is the assumption that there were fixed groups or 
blocks of individuals. One of the challenges for fu¬ 
ture research will be the issue of interference across 
groups. More generally, recent research on causal in¬ 
ference with interference has been relaxing the as¬ 
sumption of fixed groups. Aronow and Sarnii (2013) 
presented randomization-based methods for esti¬ 
mating average causal effects under arbitrary inter¬ 
ference of known form. Van der Laan (2012) incorpo¬ 
rates network information for each individual under 
consideration that describes the set of other individ¬ 
uals that each individual is potentially connected to. 
Inference is then driven by the number of individu¬ 
als rather than the number of communities, which 
in this case is one. Liu and Hudgens (2013) propose 
new generalized inverse probability weighted estima¬ 
tors of causal effects in the presence of any form of 
interference between individuals. Defining and com¬ 
paring causal estimands of effects across more gen¬ 
eral forms of interference will be challenging. Much 
more exciting research is left to be done on causal 
inference under general forms of interference. 
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APPENDIX 

Derivations for the Infectiousness Effect 


-E(Y\z',g[,l,h,u*,v*)} 

■ { P(u,v\z',g',l,h ) - P(u,v\l,h)}. 


VanderWeele and Tchetgen Tchetgen (2011b) 
showed that under monotonicity assumption 1 , p v = 
£[V i2 (l,0)|V a (l,0) = y-i(0,0) = 1] = E[Y i2 \Z n = 
l,Yn = 1] = pi and p u = £'[1^2 (0,0)|l^i(l,0) = 
*ii(0,0) = 1] = E[Y l2 \Z, A = 0 ,Y a = 1] + {E[Y i2 (0,0) | 
Z t \ = l,Yn = 1] — E[Y i2 (0,0)\Zii = 0, Yu = 1]} = 
pi + 9. From this it follows that p v —p u = (pi ~Po) — 
9 and p v /p u = Pi/(po + 9), Pv(l~Pu)/{Pu{l~Pv)} = 
PiO- -Po- 6)/{(po + 9)(1 - Pi)}, and 1 - p v /p u = 
l~Pi/(po + 9)- 

Hudgens and Halloran (2006) showed that under 
monotonicity assumption 1, p u = r yB/{ 1 + j(B — 
1)} and po = + Pu( 1 — V), where B = exp(/3), 

7 = P{Yi 2 (0,0)|lii(l,0) = 0,1*1 (0,0) = 1}, and V = 


( x ~ p(y«=i | §i=i) )- Solvin § Pu = r/B/{l+'y(B-l)} 
and po = 'yV + p u { 1 — V) to eliminate 7 gives p u = 
Po-Mi + Po-PnB-V) {B _ Qr 0 = poB _ 

Pu (-V - Bpo + po ~ B + VB) + (B — 1)(1 — V)p 2 u , 
a quadratic equation in p u . The roots of this equa¬ 
tion are ~ 4 —, where z = exp(/3)po, 6 = (1 — 

p(yn=i|zii=i) )( exp (^) - !} - exp(/3)p 0 +Po - exp(/3), 


and 


c = i e MP) ~ 11- 


Derivations for Sensitivity Analysis of Spillover 
Effect Under Unmeasured Confounding: 
Approach 1 


If in the notation of VanderWeele and Arah 
(2011), we let A, X and U be (Z,G), ( L,H ) and 
(U,V), respectively, and we let a\ and ao denote 
(. z,g ) and (z',g'), respectively, then by VanderWeele 
and Arah (2011) we have that 

B = E[Y\z,gJ,h}-E[Y\z , ,g',l,h} 

-E[Y(z,g)\l,h}-E[Y(z , ,g')\l,h} 

= E[Y\z,gJ,h\-E[Y\z',g',l,h] 

-^2{ E [ Y \z,g,l,h,u,v\ 

u,v 

- E[Y\z',g', l, h, u, v]}P(u, v\l, h ) 

= ^2{E(Y\z,g,l,h,u,v) - E(Y\z,g,l,h,u*,v*)} 

U,V 

■ {P(u,v\z,g,l,h) - P(u,v\l,h )} 
-^2{E(Y r \z',g',l,h, u ,v) 

U,V 


If E(Y\z,g,l,h,u,v) — E(Y\z,g,l,h,u*,v*) = A (u — 
u*) + T ( v — v*), then we have 

B = ^2{E(Y\z,g,l,h,u,v) - E(Y\z,g,l,h,u*,v*)} 

U,V 

• {P(u,v\z,g,l,h) - P(u,v\l,h )} 

-^2{E(Y\z', g' ,l,h,u,v) 

U,V 

— E(Y\z', g 1 , l, h,u*,v*)} 

■ { P(u,v\z',g',l,h ) - P(u,v\l,h)} 

= ^^{A(u — u*) + t{v — u*)} 

U,V 

• {P(u,v\z,g,l,h) - P(u, v\l, h)} 

— ^^{A(u — u*) + t(v — u*)} 

U,V 

■ { P(u,v\z',g',l,h ) - P(u,v\l,h)} 

= ^^(A u + tv)P(u, v\z, g, l, h ) 

U,V 

— y^ y (Au + tv) P (u, v\z' , g ', l , h) 

U,V 

= \{E[U\z,g,l,h]-E[U\z',g',l,h}} 

+ T{E[V\z,g,l,h}-E[V\z',g',l,h}}. 

Maximum Likelihood and GEE Estimation of 
Direct and Spillover Effects Under Unmeasured 
Confounding: Approach 2 

We briefly describe maximum likelihood and gen¬ 
eralized estimating equations inference for direct 
and spillover effects in the presence of unobserved 
confounding under Approach 2 . As stated in the 
text, in practice, due to the high dimensional¬ 
ity of L i often encountered in applications, para¬ 
metric models must be used to reliably estimate 
7 d 0, g , l, h), 7 s (p, l, h), /(Z^L;) and q(l, h). The pro¬ 
posed reparameterization of E[Y\z, g,l,h] is partic¬ 
ularly advantageous in that it ensures variation in¬ 
dependence of parameters of the working models for 
these various quantities, making possible a straight¬ 
forward application of maximum likelihood esti¬ 
mation. Specifically, consider the parametric mod¬ 
els 7 d (^,5 , ,^,^;V ;d ) 5 7 s (g,l,h;i’ s ), /(Zi|L»;a) and 
q(l,h;r) ); then, provided parameters are not shared 
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across working models, a particular choice of one of 
these models is guaranteed by our parametrization 
not to place any restriction on the other models. 
Maximum likelihood estimation of unknown param¬ 
eters requires that one posit an additional working 
model for the conditional density /(£j|Lj which we 
denote /(e^L^a;), for the vector of possibly cor¬ 
related residuals Y^— E[Y\z,g,l,h], j = l,...,rii. 
In principle, our choice of parametrization could be 
used in conjunction with standard techniques for 
modeling clustered outcomes, such as, for instance, 
by incorporating a random intercept to introduce 
correlation within a cluster in the regression model 
of Yij. Maximum likelihood estimation then pro¬ 
ceeds by maximization of 

N 

log n /( e < ^ V, a)I L i,z; w)/(Zi|L i; a) 

i=1 

with respect to (i/> d ,’tp s ,r],a,tjj). A simple alternative 
to maximum likelihood estimation entails finding a 
that maximizes the partial log-likelihood 


individuals in cluster i excluding person j: 

fi\d( z ,g,l,h) = X d z, S 8 £(g,l,h) = \*g, 
$x?{z,g,l,h) = z(Xf + X$g), 
d^(g,l,h)=g(Xt + X s 2 l), 

where for 6 and 6 d J , the scalar parameters X d and 
(Xf,X d ) encode the magnitude and direction of un¬ 
measured confounding for the effect of person j’s ex¬ 
posure Zij on his outcome Y t j, and for 5^} and , 
the scalar parameters A s and (Af,A|) encode the 
magnitude and direction of unmeasured confound¬ 
ing for the causal effect of the total number exposed 
Ylj'jtjZij 1 excluding person j, within the cluster i 
on person j’s outcome Yij. 

The functions and <^’ 3 2 model interactions be¬ 
tween and Ylj'^j Zij ', thus allowing for hetero¬ 
geneity in the selection bias function. Since the func¬ 
tional form of (S d d ,S^ s ) is not identified from the 
observed data, we generally recommend reporting 
results for a variety of functional forms. 


N 

logJJ/(Zi|L i; a) 

i=i 


and then finding the parameter value tp s , rj ) that 

solves the following generalized estimating equation 
with independence working correlation structure: 


E 


dEij(ip d , ip s , r), a) 


£ij$ d ,$ s ,g, a) = 0. 


Note that the dependence of on ( 5 d , 6 s ) has been 
suppressed in the notation used above; such depen¬ 
dence is made explicit in a sensitivity analysis which 
is obtained by repeating either maximum likelihood 
estimation or the estimating equations approach 
given above as (5 d ,5 s ) is varied within a finite set of 
user-specified functions T = { S r ! d . 5'^ s : X = ( X d , A s )} 
indexed by a finite dimensional parameter A with 
(<5 q,<5q) € T corresponding to the ignorability as¬ 
sumption, that is, Sq = Sq = 0. 

In applying this approach it is helpful to briefly 
describe possible functional forms for the selection 
bias functions 5 d ,5 s . In practice, it may be con¬ 
venient to specify simple parametric models for 
each of these functions as illustrated in the follow¬ 
ing display. To illustrate, one may set the function 
= J2j'^j Zij' to equal the number of exposed 
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